feat: Implement Ollama Streaming Chat API with FastAPI #26

sourabh-josh · 2024-11-22T11:32:03Z

This PR implements a streaming chat API endpoint using FastAPI and Ollama LLM, featuring real-time response streaming with JSON formatting and completion signaling.

Changes

Added streaming chat endpoint with FastAPI
Implemented Ollama client integration
Added JSON response formatting with completion status
Added Ollama server setup script

New Files

main.py: FastAPI server implementation with streaming endpoint
ollama_client.py: Ollama integration and chat handling
ollama.sh: Server setup and initialization script

Before hitting endpoint

pull ollama
./ollama.sh pull

start ollama
./ollama.sh start

for more
./ollama.sh --help

API Endpoint

POST /chat

Request Body:

{
  "message": "string"
}


#TODO: vector embeddings for input query
refer `/llm/OllamaService.py:ollama_client method` left an comment for more help

… the same

feat[ollama]: streaming chat with history context

9b881b6

sourabh-josh requested a review from Selectus2 November 22, 2024 11:32

Selectus2 added 3 commits November 26, 2024 12:24

Update table names and also llm model is changed to llama3.2

31f702c

Create embedding service has been added along with db actions to call…

cc60306

… the same

Update embedding model for the query and webscrapped data

5f5b7df

Selectus2 linked an issue Nov 26, 2024 that may be closed by this pull request

RAG setup for chatbot #23

Open

4 tasks

Selectus2 closed this Dec 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Implement Ollama Streaming Chat API with FastAPI #26

feat: Implement Ollama Streaming Chat API with FastAPI #26

sourabh-josh commented Nov 22, 2024

feat: Implement Ollama Streaming Chat API with FastAPI #26

feat: Implement Ollama Streaming Chat API with FastAPI #26

Conversation

sourabh-josh commented Nov 22, 2024

Changes

New Files

API Endpoint